AITopics | seldonian reinforcement learning algorithm

Collaborating Authors

seldonian reinforcement learning algorithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms

Neural Information Processing SystemsDec-24-2025, 03:07:15 GMT

We analyze the extent to which existing methods rely on accurate training data for a specific class of reinforcement learning (RL) algorithms, known as Safe and Seldonian RL. We introduce a new measure of security to quantify the susceptibility to perturbations in training data by creating an attacker model that represents a worst-case analysis, and show that a couple of Seldonian RL methods are extremely sensitive to even a few data corruptions. We then introduce a new algorithm that is more robust against data corruptions, and demonstrate its usage in practice on some RL problems, including a grid-world and a diabetes treatment simulation.

name change, security analysis, seldonian reinforcement learning algorithm, (4 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.86)

Add feedback

Review for NeurIPS paper: Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms

Neural Information Processing SystemsJan-25-2025, 03:53:40 GMT

Weaknesses: W1: The study seems to focus too much on algorithms that are based on safety tests. I understand that the analysis is not compatible, but maybe that would be worth it to include studies on how easy it is to trick those algorithms too. More generally (even for IS algorithms), it was a bit odd to me that the study does not consider attacks on the way pi_e is chosen. W2: It's unclear to me whether the trajectory must still have been performed in the real environment, or it can be completely be made up (but then its value has to be within the range [0,1]). Also, with model based methods (for both environment and policy models), it might be possible to single out the few trajectories that are inconsistent with the other trajectories.

security analysis, seldonian reinforcement learning algorithm, trajectory, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Add feedback

Review for NeurIPS paper: Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms

Neural Information Processing SystemsJan-25-2025, 03:53:34 GMT

All the reviewers support acceptance for the contributions, notably improvements to the robustness of RL algorithms to adversarial attacks, and a clear exposition on how these methods can be applied to real world problems. Please consider revising the paper to address the concerns raised in the reviews and rebuttal, in particular to better explain the scope of the work. Separately, it may be useful to extend the broader impact statement to inform a casual reader that a mathematical safety guarantee on an algorithm is not a replacement for domain specific safety requirements (for example, the diabetes treatment would still need oversight for medical safety).

neurips paper, security analysis, seldonian reinforcement learning algorithm, (1 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms

Neural Information Processing SystemsOct-10-2024, 09:41:08 GMT

data corruption, security analysis, seldonian reinforcement learning algorithm, (1 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.98)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback